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DETAILED ACTION 

1 . This Office Action is in response to the Amendments and Arguments filed on 
08/18/2008. Claims 1, 6-10, and 15-22 remain pending and have been examined. 
Claims 3 and 12 have been cancelled. The Applicants' amendment and remarks have 
been carefully considered, but they are moot in view of new grounds for rejection. 

All previous objections and rejections directed to the Applicant's disclosure and 
claims not discussed in this Office Action have been withdrawn by the Examiner. 

Continued Examination Under 37 CFR 1.114 

2. A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1.17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 
08/18/2008 has been entered. 

Response to Arguments 

3. Applicant's arguments (pages 8-10) filed on 08/18/2008) with regard to claims 1 , 
6-10, and 15-22 have been considered but are moot in view of the new ground(s) of 
rejection. 

Response to Amendment 
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4. Applicants' amendments filed on 08/18/2008 have been fully considered. The 
newly amended limitations in claims 1 and 10 necessitate new grounds of rejection. 



Claim Rejections - 35 USC § 103 

5. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed 
or described as set forth in section 102 of this title, if the differences between the 
subject matter sought to be patented and the prior art are such that the subject 
matter as a whole would have been obvious at the time the invention was made 
to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was 
made. 

6. Claims 1, 8-10, and 17-19 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Maeda (US PGPub 2001/0048770, 12/06/2001) in view of Date (US 
6,605,768) in view of Ichimura (US 6,1 88, 831 ). 

As to claim 1, Maeda teaches an image and audio processing apparatus 
comprising: 

an input unit configured to input image data and audio data corresponding 
thereto (see Figure 24, MPEG-4 encoded data input unit 2401 and see [0202], 
where image and audio data is associated with the data); 

image encoding unit configured to encode the image data (see page 4, 
[0080], line 19); 
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audio data unit configured to encode the audio data (see page 12, [0204], 
line 1-3) for encoding audio data inputted together with the image data (see page 
12, [0204], line 2); 

encoding of audio data during the period for which audio data (see Maeda, 
page 12, [0204], line 1-3) is processed with high acoustic quality (e.g. This is 
referring to the ROI set by the user) inputted together with the image data (see 
page 12, [0204], line 1-3) (e.g. It should be note that in the reference the audio 
data is appended with the image data. Further, the applicant regards the audio 
setting means as the ROI setting done for the image and since the audio is 
appended it is inherent that the audio data will be affected for specific region. The 
audio data is encoded by a MPEG4 encoder).. 

an image encoding setting unit configured to set (see page 2, [0026], lines 
3-4) for setting the encoding said image encoding unit to encode the image data 
(e.g. Specifying an ROI of the moving image as being set by the reference and 
has been known to be of higher quality) so that a partial region in each frame 
images included in the image data is encoded with a high image quality (see 
page 1, [0007], lines 1-3 and [0249])); and 

an audio encoding setting unit (see page 2, [0026], lines 3-4) configured to 
set an execution of the encoding in said second audio encoding unit in a case 
that said image encoding unit effects the setting of the encoding (see page 12, 
[0204], line 1-3) (e.g. It should be note that in the reference the audio data is 
appended with the image data. Further, the applicant regards the audio setting 



Application/Control Number: 10/656,375 Page 5 

Art Unit: 2626 

means as the ROI setting done for the image and since the audio is appended it 
is inherent that the audio data will be affected for specific region. The audio data 
is encoded by a MPEG4 encoder).; 

data integration unit configured (see Figure 24, multiplexer 2414) image 
encoding setting means (see Figure 24, mask encoder 2412 and [0202], ROI 
shape and position information) and the encoded audio data by said audio 
encoding means (see Figure 24, output of entropy encoder 241 3 and input into 
multiplexer.) corresponding to the period of the frame images encoded with high 
image quality, and outputting the integrated data (see Figure 24, output of 
multiplexer 2414 and input into code output unit 2415.) 

However, Maeda does not specifically disclose the use of two separate 
encoding units for encoding audio data using two separate methods and 
selectively outputting the encoded data. 

Date et al. does teach the first audio encoding unit configured to encode 
the by an audio encoding method for encoding general audio data (see col. 6, 
lines 39-42, different compression method such as MIDI, for musical component) 

second audio encoding unit configured to encode the(see col. 6, lines 35- 
40, CELP) the audio data by another encoding method which is suitable for 
encoding speech data (see col. 6, lines 35-40, vocal components) 

It would have been obvious to one of ordinary skilled in at the time the 
invention was made to have modified the image processing taught by Maeda 
with the multiple coders as taught by Date. The motivation to have combined the 
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two references involves the ability to produce high quality audio (see Date col. 1 , 
lines 49-56)) as would benefit the image processing apparatus taught by Maeda, 
which discloses image and audio data corresponding to the ROI. Further, the 
combination would allow the system taught by Maeda to produce high quality 
audio for the ROI representing the audio and image. 

However, Maeda in view of Date do not specifically teach the integration 
encoded audio data in a case image encoding setting unit does not effect the 
setting of the encoding, and to integrate with the encode image in a case where 
image setting unit effects the setting of the encoding thereby outputting 
integrated encoded data . 

Ichimura does teach 

the integration encoded audio data in a case image encoding setting unit 
does not effect the setting of the encoding, and to integrate with the encode 
image in a case where image setting unit effects the setting of the encoding 
thereby outputting integrated encoded data (see col. 5, lines 32-45, image and 
audio data are stored with high quality for a given interval whereas the other 
intervals are processed as normal.), (e.g. Hence, it would have been obvious to 
one skilled in the art that the combination of Maeda in view of Date in view of 
Ichimura would be integrating audio data with respect to the first encoding unit 
with image when importance is not given consideration for maintaining high 
quality for important sections as opposed to unimportant sections (See Ichimura, 
col. 5, lines 40-46). Further, in regards to the latter limitation of the data 
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integration unit, Date teaches the use of multiple encoders and for detecting and 
compressing based on type of signals (see Date, col. 6, lines 31-47), while 
Ichimura teaches the compression analysis based on important sections and if 
an important section is detected then performing compression to ensure high 
quality ((see col. 5, lines 40-46). The data integration occurs as the image has a 
corresponding audio content that when compression takes place, both the audio 
and image data are accounted for (see Ichimura col. 5, lines 40-46 and Maeda, 
Figure 24, and [0202]). 

It would have been obvious to one of ordinary skilled in at the time the 
invention was made to have modified the image processing taught by Maeda in 
view of Date with the encoding of high quality audio in selected regions of an 
image as taught by Ichimura. The motivation to have combined the two 
references involves the ability to produce accurate playback for important regions 
in a signal (i.e. ROI) (see Ichimura, col. 5, lines 5-8). 

As to claims 6,7,15, and 1 6 Maeda in view of Date in view of Ichimura 
teach all of the limitations as in claims 1 and 10, above. 

Furthermore, Maeda teaches wherein said image encoding setting unit 
makes the setting so as to encode a region (see [0247], ROI is set by the region 
setting unit), with the high image quality (see [0007], ROI is determined for higher 
image quality), including an arbitrary object in the image data (see [0247], user 
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can set a region of interest that includes objects. Further, the ROI is a region that 
is to be encoded (see [0247])). 



As to claims 8 and 17, Maeda in view of Date in view of Ichimura teach all 
of the limitations as in claims 1 and 10, above. 

Furthermore, Maeda teaches wherein said image encoding setting means 
makes the setting so as to encode a partial region of the image (see page 15, 
[0247], line 12-13) (e.g. ROI) data with the high image quality (see Maeda, page 
1 , [0007], lines 1-3) in accordance with a user's instruction for designating an 
object displayed on a display screen (see page 15, [0247], line 14-15) (e.g. User 
can set ROI from the displayed data obtained from a camera.). 



As to claims 9 and 18, Maeda in view of Date in view of Ichimura teach all 
of the limitations as in claims 1 and 10, above. 

Furthermore, Maeda teaches wherein image encoding setting means 
makes the ROI setting image (see page 15, [0247], line 12-13) in accordance 
with the user's instruction (see page 15, [0247], line 14-15), and wherein said 
image encoding means executes the ROI encoding (see page 15, [0247], line 16- 
17). 



As to claims 10 and 19, Maeda teaches an image processing method 
comprising: 
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an input unit step of inputting image data and audio data corresponding 
thereto (see Figure 24, MPEG-4 encoded data input unit 2401 and see [0202], 
where image and audio data is associated with the data); 

image encoding step of encoding the image data (see page 4, [0080], line 

19); 

audio data unit step of encoding audio data (see page 12, [0204], line 1- 
3) for encoding audio data inputted together with the image data (see page 12, 
[0204], line 2); 

encoding of audio data during the period for which audio data (see Maeda, 
page 12, [0204], line 1-3) is processed with high acoustic quality (e.g. This is 
referring to the ROI set by the user) inputted together with the image data (see 
page 12, [0204], line 1-3) (e.g. It should be note that in the reference the audio 
data is appended with the image data. Further, the applicant regards the audio 
setting means as the ROI setting done for the image and since the audio is 
appended it is inherent that the audio data will be affected for specific region. The 
audio data is encoded by a MPEG4 encoder).. 

an image encoding setting step of setting (see page 2, [0026], lines 3-4) 
for setting the encoding said image encoding step to control an image quality in 
accordance (e.g. Specifying an ROI of the moving image as being set by the 
reference and has been known to be of higher quality) with a partial region in 
each frame images included in the image data is encoded with a high image 
quality (see page 1, [0007], lines 1-3 and [0249])); and 
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an audio encoding setting step (see page 2, [0026], lines 3-4) setting an 
execution of the encoding in said second audio encoding unit in a case that said 
image encoding step effects the setting of the encoding (see page 12, [0204], 
line 1-3) (e.g. It should be note that in the reference the audio data is appended 
with the image data. Further, the applicant regards the audio setting means as 
the ROI setting done for the image and since the audio is appended it is inherent 
that the audio data will be affected for specific region. The audio data is encoded 
by a MPEG4 encoder).; 

data integration unit step (see Figure 24, multiplexer 2414) image 
encoding setting step (see Figure 24, mask encoder 2412 and [0202], ROI 
shape and position information) and the encoded audio data by said audio 
encoding step(see Figure 24, output of entropy encoder 2413 and input into 
multiplexer.) corresponding to the period of the frame images encoded with high 
image quality, and outputting the integrated data (see Figure 24, output of 
multiplexer 2414 and input into code output unit 2415.) 

However, Maeda does not specifically disclose the use of two separate 
encoding step for encoding audio data using two separate methods and 
selectively outputting the encoded data. 

Date et al. does teach the first audio encoding step encoding audio 
encoding audio data by an audio encoding method for encoding general audio 
data (see col. 6, lines 39-42, different compression method such as MIDI, for 
musical component) 
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second audio encoding step encoding the (see col. 6, lines 35-40, CELP) 
the audio data by another encoding method which is suitable for encoding 
speech data (see col. 6, lines 35-40, vocal components) 

It would have been obvious to one of ordinary skilled in at the time the 
invention was made to have modified the image processing taught by Maeda 
with the multiple coders as taught by Date. The motivation to have combined the 
two references involves the ability to produce high quality audio (see Date col. 1 , 
lines 49-56)) as would benefit the image processing apparatus taught by Maeda, 
which discloses image and audio data corresponding to the ROI. Further, the 
combination would allow the system taught by Maeda to produce high quality 
audio for the ROI representing the audio and image. 

However, Maeda in view of Date do not specifically teach the integration 
encoded audio data in a case image encoding setting unit does not effect the 
setting of the encoding, and to integrate with the encode image in a case where 
image setting unit effects the setting of the encoding thereby outputting 
integrated encoded data . 

Ichimura does teach 

the integration step of encoded audio data in a case image encoding 
setting step does not effect the setting of the encoding, and to integrate with the 
encode image in a case where image setting unit effects the setting of the 
encoding thereby outputting integrated encoded data (see col. 5, lines 32-45, 
image and audio data are stored with high quality for a given interval whereas the 
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other intervals are processed as normal.)- (e.g. Hence, it would have been 
obvious to one skilled in the art that the combination of Maeda in view of Date in 
view of Ichimura would be integrating audio data with respect to the first 
encoding unit with image when importance is not given consideration for 
maintaining high quality for important sections as opposed to unimportant 
sections (See Ichimura, col. 5, lines 40-46). Further, in regards to the latter 
limitation of the data integration step, Date teaches the use of multiple encoders 
and for detecting and compressing based on type of signals (see Date, col. 6, 
lines 31-47), while Ichimura teaches the compression analysis based on 
important sections and if an important section is detected then performing 
compression to ensure high quality ((see col. 5, lines 40-46). The data integration 
occurs as the image has a corresponding audio content that when compression 
takes place, both the audio and image data are accounted for (see col. 5, lines 
40-46 and Maeda, Figure 24, and [0202]). 

It would have been obvious to one of ordinary skilled in at the time the 
invention was made to have modified the image processing taught by Maeda in 
view of Date with the encoding of high quality audio in selected regions of an 
image as taught by Ichimura. The motivation to have combined the two 
references involves the ability to produce accurate playback for important regions 
in a signal (i.e. ROI) (see Ichimura, col. 5, lines 5-8). 
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As to claim 1 9, Maeda in view of Date in view of Ichimura teach all of the 
limitations as in claim 10 and further teaches the use of a storage medium (see 
[0302], storage medium). 



7. Claims 21 and 22 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Maeda (US PGPub 2001/0048770, 12/06/2001) in view of Date (US 6,605,768) in 
view of Ichimura (US 6,188, 831) as applied to claims 1 and 10 above, and further in 
view of Iseda (US 5,091 ,955). 

Iseda does teach selection of encoded audio data having higher quality by 
comparing (a) audio quality of the encoded audio data encoded by said first 
audio encoding unit and (b) audio quality of the encoded audio data encoded by 
said second audio encoding unit, and integrates the selected encoded audio data 
with the encoded image data (see col. 4, lines 56-65, selection done based on 
which coder set has optimum quality). 

It would have been obvious to one of ordinary skilled in at the time the 
invention was made to have modified the image processing taught by Maeda in 
view of Date in view of Ichimura with the selection of an optimal coder as taught 
by Iseda. The motivation to have combined the two references involves the ability 
to produce high quality audio (see Iseda, col. 2, lines 40-15 and 50-55) as would 
benefit the image processing apparatus taught by Maeda, which discloses image 
and audio data corresponding to the ROI. Further, the combination would allow 
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the system taught by Maeda to produce high quality audio for the ROI 
representing the audio and image. 



Conclusion 

8. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to PARAS SHAH whose telephone number is (571)270- 
1650. The examiner can normally be reached on MON.-THURS. 7:00a. m.-4:00p.m. 
EST. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Patrick Edouard can be reached on (571)272-7603. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



/Paras Shah/ 
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